A Turkish Handprint Character Recognition System
نویسندگان
چکیده
This paper presents a study for recognizing isolated Turkish handwritten uppercase letters. In the study, first of all, a Turkish Handprint Character Database has been created from the students in Istanbul Technical University (ITU). There are about 20000 uppercase and 7000 digit samples in this database. Several feature extraction and classification techniques are realized and combined to find the best recognition system for Turkish characters. Features, obtained from Karhunen-Loéve Transform, Zernike Moments, Angular Radial Transform and Geometric Features, are classified with Artificial Neural Networks, K-Nearest Neighbor, Nearest Mean, Bayes, Parzen and Size Dependent Negative Log-Likelihood methods. Geometric moments, which are suitable for Turkish characters, are formed. KLT features are fused with other features since KLT gives the best recognition rate but has no information about the shape of the character where other methods have. The fused features of KLT and ART classified by SDNLL gives the best result for Turkish characters in the experiments.
منابع مشابه
Turkish handwritten text recognition: a case of agglutinative languages
We describe a system for recognizing unconstrained Turkish handwritten text. Turkish has agglutinative morphology and theoretically an infinite number of words that can be generated by adding more suffixes to the word. This makes lexicon-based recognition approaches, where the most likely word is selected among all the alternatives in a lexicon, unsuitable for Turkish. We describe our approach ...
متن کاملA Neural Approach to Concurrent Character Segmentation and Recognition
This paper presents a neural network solution that combines character segmentation and character recognition concurrently as a single task. Current segmentation methods utilize traditional image processing techniques such as spatial histograms which are only 60% accurate on handprint. Using traditional techniques for segmenting handprint in a model recognition system running on a massively para...
متن کاملComponent-based handprint segmentation using adaptive writing style model
Building upon the utility of connected components, NIST has designed a new character segmentor based on statistically modeling the style of a person’s handwriting. Simple spatial features (the thickness of the pen stroke and the height of the handwriting) capture the characteristics of a particular writer’s style of handprint, enabling the new method to maintain a traditional character-level se...
متن کاملPublic domain optical character recognition
A public domain document processing system has been developed by the National Institute of Standards and Technology (NIST). The system is a standard reference form-based handprint recognition system for evaluating optical character recognition (OCR), and it is intended to provide a baseline of performance on an open application. The system’s source code, training data, performance assessment to...
متن کاملNIST Form-Based Handprint Recognition System
The National Institute of Standards and Technology (NIST) has developed a new release of a standard reference form-based handprint recognition system for evaluating optical character recognition. As with the first release, NIST is making the new recognition system freely available to the general public on CD-ROM. This source code testbed, written entirely in C, contains both the original and th...
متن کامل